explosion's Repositories

78 repositories

.github
:octocat: GitHub settings
โญ 2 ๐ŸŒ Public
aiGrunn-2023
Materials for the aiGrunn 2023 talk on spaCy Transformer pipelines
โญ 1 ๐ŸŒ Public
assets
๐Ÿ’ฅ Explosion Assets
โญ 45 ๐ŸŒ Public
blis
BLAS-like Library Instantiation Software Framework
โญ 1 ๐ŸŒ Public
catalogue
Super lightweight function registries for your library
โญ 180 ๐ŸŒ Public
confection
:candy: Confection: the sweetest config system for Python
โญ 192 ๐ŸŒ Public
conll-2012
A slightly cleaned up version of the scripts & data for the CoNLL 2012 Coreference task.
โญ 9 ๐ŸŒ Public
curated-tokenizers
Lightweight piece tokenization library
โญ 12 ๐ŸŒ Public
curated-transformers
๐Ÿค– A PyTorch library of curated Transformer models and their composable components
โญ 894 ๐ŸŒ Public
curated-transformers-addons
Add-ons for Curated Transformers
โญ 0 ๐ŸŒ Public
cymem
๐Ÿ’ฅ Cython memory pool for RAII-style memory management
โญ 460 ๐ŸŒ Public
cython-blis
๐Ÿ’ฅ Fast matrix-multiplication as a self-contained Python library โ€“ no system dependencies!
โญ 229 ๐ŸŒ Public
displacy
:boom: displaCy.js: An open-source NLP visualiser for the modern web
โญ 345 ๐ŸŒ Public ๐Ÿ“ฆ Archived
displacy-ent
:boom: displaCy-ent.js: An open-source named entity visualiser for the modern web
โญ 199 ๐ŸŒ Public ๐Ÿ“ฆ Archived
ec2buildwheel
No description
โญ 3 ๐ŸŒ Public
fastapi-explosion-extras
No description
โญ 0 ๐ŸŒ Public
floret
๐ŸŒธ fastText + Bloom embeddings for compact, full-coverage vectors with spaCy
โญ 329 ๐ŸŒ Public
gha-cibuildwheel
No description
โญ 0 ๐ŸŒ Public
healthsea
Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.
โญ 92 ๐ŸŒ Public
jupyterlab-prodigy
๐Ÿงฌ A JupyterLab extension for annotating data with Prodigy
โญ 189 ๐ŸŒ Public
lightnet
๐ŸŒ“ Bringing pjreddie's DarkNet out of the shadows #yolo
โญ 320 ๐ŸŒ Public ๐Ÿ“ฆ Archived
ml-datasets
๐ŸŒŠ Machine learning dataset loaders for testing and example scripts
โญ 47 ๐ŸŒ Public
murmurhash
๐Ÿ’ฅ Cython bindings for MurmurHash2
โญ 44 ๐ŸŒ Public
nginx_acm_ssl_proxy
Nginx container that allows for environmental variable use to set nginx configuration.
โญ 0 ๐ŸŒ Public
os-signpost
Wrapper for the macOS signpost API
โญ 16 ๐ŸŒ Public
preshed
๐Ÿ’ฅ Cython hash tables that assume keys are pre-hashed
โญ 87 ๐ŸŒ Public
princetondh
Code for our presentation in Princeton DH 2023 April.
โญ 4 ๐ŸŒ Public
prodigy-ann
A Prodigy pluging for ANN techniques
โญ 5 ๐ŸŒ Public
prodigy-evaluate
๐Ÿ”Ž A Prodigy plugin for evaluating spaCy pipelines
โญ 13 ๐ŸŒ Public
prodigy-hf
Train huggingface models on top of Prodigy annotations
โญ 21 ๐ŸŒ Public
prodigy-lunr
A Prodigy plugin for document search via LUNR
โญ 3 ๐ŸŒ Public
prodigy-openai-recipes
โœจ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3
โญ 323 ๐ŸŒ Public ๐Ÿ“ฆ Archived
prodigy-pdf
A Prodigy plugin for PDF annotation
โญ 36 ๐ŸŒ Public
prodigy-recipes
๐Ÿณ Recipes for the Prodigy, our fully scriptable annotation tool
โญ 504 ๐ŸŒ Public
prodigy-segment
Select pixels in Prodigy via Facebook's Segment-Anything model.
โญ 10 ๐ŸŒ Public
prodigy-whisper
Audio transcription with OpenAI's whisper model in the loop.
โญ 5 ๐ŸŒ Public
projects
๐Ÿช End-to-end NLP workflows from prototype to production
โญ 1405 ๐ŸŒ Public
radicli
๐Ÿ•Š๏ธ Radically lightweight command-line interfaces
โญ 108 ๐ŸŒ Public
sense2vec
๐Ÿฆ† Contextually-keyed word vectors
โญ 1667 ๐ŸŒ Public
spaCy
๐Ÿ’ซ Industrial-strength Natural Language Processing (NLP) in Python
โญ 32986 ๐ŸŒ Public
spacy-alignments
๐Ÿ’ซ A spaCy package for Yohei Tamura's Rust tokenizations library
โญ 34 ๐ŸŒ Public
spacy-benchmarks
๐Ÿ’ซ Runtime performance comparison of spaCy against other NLP libraries
โญ 20 ๐ŸŒ Public ๐Ÿ“ฆ Archived
spacy-biaffine-parser
No description
โญ 1 ๐ŸŒ Public
spacy-course
๐Ÿ‘ฉโ€๐Ÿซ Advanced NLP with spaCy: A free online course
โญ 2395 ๐ŸŒ Public
spacy-curated-transformers
spaCy entry points for Curated Transformers
โญ 32 ๐ŸŒ Public
spacy-dev-resources
๐Ÿ’ซ Scripts, tools and resources for developing spaCy
โญ 126 ๐ŸŒ Public ๐Ÿ“ฆ Archived
spacy-experimental
๐Ÿงช Cutting-edge experimental spaCy components and features
โญ 105 ๐ŸŒ Public
spacy-huggingface-hub
๐Ÿค— Push your spaCy pipelines to the Hugging Face Hub
โญ 45 ๐ŸŒ Public
spacy-huggingface-pipelines
๐Ÿ’ฅ Use Hugging Face text and token classification pipelines directly in spaCy
โญ 63 ๐ŸŒ Public
spacy-io-binder
๐Ÿ“’ Repository used to build Binder images for the interactive spaCy code examples
โญ 1 ๐ŸŒ Public
spacy-layout
๐Ÿ“š Process PDFs, Word documents and more with spaCy
โญ 828 ๐ŸŒ Public
spacy-legacy
๐Ÿ•ธ๏ธ Legacy architectures and other registered spaCy v3.x functions for backwards-compatibility
โญ 4 ๐ŸŒ Public
spacy-llm
๐Ÿฆ™ Integrating LLMs into structured NLP pipelines
โญ 1355 ๐ŸŒ Public
spacy-loggers
๐Ÿ“Ÿ Logging utilities for spaCy
โญ 12 ๐ŸŒ Public
spacy-lookups-data
๐Ÿ“‚ Additional lookup tables and data resources for spaCy
โญ 113 ๐ŸŒ Public
spacy-models
๐Ÿ’ซ Models for the spaCy Natural Language Processing (NLP) library
โญ 1825 ๐ŸŒ Public
spacy-notebooks
๐Ÿ’ซ Jupyter notebooks for spaCy examples and tutorials
โญ 288 ๐ŸŒ Public ๐Ÿ“ฆ Archived
spacy-pkuseg
pkusegๅคš้ข†ๅŸŸไธญๆ–‡ๅˆ†่ฏๅทฅๅ…ท; The pkuseg toolkit for multi-domain Chinese word segmentation
โญ 67 ๐ŸŒ Public
spacy-ray
โ˜„๏ธ Parallel and distributed training with spaCy and Ray
โญ 56 ๐ŸŒ Public
spacy-services
๐Ÿ’ซ REST microservices for various spaCy-related tasks
โญ 241 ๐ŸŒ Public ๐Ÿ“ฆ Archived
spacy-stanza
๐Ÿ’ฅ Use the latest Stanza (StanfordNLP) research models directly in spaCy
โญ 743 ๐ŸŒ Public
spacy-streamlit
๐Ÿ‘‘ spaCy building blocks and visualizers for Streamlit apps
โญ 849 ๐ŸŒ Public
spacy-transformers
๐Ÿ›ธ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
โญ 1402 ๐ŸŒ Public
spacy-vectors-builder
๐ŸŒธ Train floret vectors
โญ 18 ๐ŸŒ Public
spacy-vscode
spaCy extension for Visual Studio Code
โญ 31 ๐ŸŒ Public
spacymoji
๐Ÿ’™ Emoji handling and meta data for spaCy with custom extension attributes
โญ 183 ๐ŸŒ Public
span-labeling-datasets
Loaders for various span labeling datasets
โญ 2 ๐ŸŒ Public
srsly
๐Ÿฆ‰ Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)
โญ 480 ๐ŸŒ Public
talks
๐Ÿ’ฅ Browser-based slides or PDFs of our talks and presentations
โญ 94 ๐ŸŒ Public ๐Ÿ“ฆ Archived
thinc
๐Ÿ”ฎ A refreshing functional take on deep learning, compatible with your favorite libraries
โญ 2884 ๐ŸŒ Public
thinc-apple-ops
๐Ÿ Make Thinc faster on macOS by calling into Apple's native Accelerate library
โญ 103 ๐ŸŒ Public
thinc_gpu_ops
๐Ÿ”ฎ GPU kernels for Thinc
โญ 9 ๐ŸŒ Public ๐Ÿ“ฆ Archived
tokenizations
Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/
โญ 193 ๐ŸŒ Public ๐Ÿ“ฆ Archived
vscode-prodigy
๐Ÿงฌ A VS Code extension for annotating data with Prodigy
โญ 30 ๐ŸŒ Public
wasabi
๐Ÿฃ A lightweight console printing and formatting toolkit
โญ 467 ๐ŸŒ Public
weasel
๐Ÿฆฆ weasel: A small and easy workflow system
โญ 88 ๐ŸŒ Public
wheelwright
๐ŸŽก Automated build repo for Python wheels and source packages
โญ 174 ๐ŸŒ Public
wikid
Generate a SQLite database from Wikipedia & Wikidata dumps.
โญ 35 ๐ŸŒ Public